AITopics | fast and robust inference

Collaborating Authors

fast and robust inference

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

BERT Loses Patience: Fast and Robust Inference with Early Exit

Neural Information Processing SystemsDec-24-2025, 16:50:46 GMT

In this paper, we propose Patience-based Early Exit, a straightforward yet effective inference method that can be used as a plug-and-play technique to simultaneously improve the efficiency and robustness of a pretrained language model (PLM). To achieve this, our approach couples an internal-classifier with each layer of a PLM and dynamically stops inference when the intermediate predictions of the internal classifiers do not change for a pre-defined number of steps. Our approach improves inference efficiency as it allows the model to make a prediction with fewer layers. Meanwhile, experimental results with an ALBERT model show that our method can improve the accuracy and robustness of the model by preventing it from overthinking and exploiting multiple classifiers for prediction, yielding a better accuracy-speed trade-off compared to existing early exit methods.

bert lose patience, fast and robust inference, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.79)

Add feedback

Review for NeurIPS paper: BERT Loses Patience: Fast and Robust Inference with Early Exit

Neural Information Processing SystemsAug-16-2025, 15:14:33 GMT

Summary and Contributions: The authors proposes early stopping at test-time to improve inference speed and accuracy. The idea is to train a classifier at each layer of multi-layered embedding model like BERT and perform classification one layer at time, stopping when the prediction stops changing. They demonstrate empirically that the method improves both the speed and accuracy of BERT/ALBERT on the GLUE benchmarks. My opinion of the work remains the same after the response. Strengths: Simple straightforward idea that would be easy to implement directly from the description of the paper and that performs better in some cases than more complicated methods.

accuracy, classifier, end-task accuracy, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

BERT Loses Patience: Fast and Robust Inference with Early Exit

Neural Information Processing SystemsOct-11-2024, 11:21:11 GMT

bert lose patience, early exit, fast and robust inference, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.86)

Add feedback